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Abstract 

This paper considers the problem of rate function identification for multidimensional queue- 
ing models with feedback. A set of techniques are introduced which allow this identification 
when the model possesses certain structural properties. The main tools used are representation 
formulas for exponential integrals, weak convergence methods, and the regularity properties 
of associated Skorokhod Problems. Two examples are treated as special cases of the general 
theory: the classical Jackson network and a model for processor sharing. 

1 Introduction 

Although there has been considerable interest in establishing a theory of large deviations for queue- 
ing networks and related systems [17], there are few general results for multidimensional systems 
with feedback, save theorems which establish the existence of a large deviation principle but fail 
to provide an explicit formula for the rate function [3]. In this paper we introduce techniques 
that allow one to fill in this gap, at least for families of networks that possess certain structural 
properties. The main tools we use are the representation formulas used to prove existence in [3], 
weak convergence methods, and the regularity properties of an associated Skorokhod Problem. 

We begin with a review of the literature that deals with large deviation properties of queueing 
networks. One of the first papers on the topic is [8], which establishes certain large deviation 
properties of Jackson networks via nonlinear PDE techniques. Unfortunately, the methods of this 
paper do not extend easily to more general situations. A probabilistic method is used to prove 
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large deviation upper bounds for a fairly general class of Markov models in [6]. The corresponding 
lower bound is not proved, and so the tightness of these upper bounds remains an open question, 
although a partial answer will be given in the present work. 

A number of techniques have been developed that are very much tailored to particular models. 
For example, the special class of tandem queues is quite tractable, in large part because the absence 
of feedback means that continuous mapping methods can be applied. Indeed, in this case one can 
represent the queueing model as the composition of the mapping on path space defined by a suitable 
Skorokhod Problem and an unconstrained process [7]. When such a representation is available the 
large deviations analysis can be carried out by applying the contraction principle. Results for 
models of this type can be found in [2, 14, 18]. Although this method can be extended to cover a 
broader class of models (e.g., feedforward as in [13, 15]), it breaks down when feedback is present. 
A general result on large deviations for processes whose statistical behavior can be discontinuous 
across a smooth (n — l)-dimensional interface in IR ra was proved in [5], and then applied in [11] to 
prove the large deviation principle for a general class of stable two dimensional queueing models. 
Related results that also rely on a reduction to what are essentially one dimensional problems 
include [1, 19]. 

The only existing theory that does not make significant use of model specific geometric features 
is presented in [3], which considers a general class of jump Markov processes that model queueing 
systems. This paper proves the existence of a large deviation principle, and also provides a charac- 
terization of the rate function. The paper falls short, however, in that it does not identify the rate 
function. 

As the previous discussion suggests, no general method has emerged for rate function iden- 
tification when there is feedback and if the dimension is greater than two. On the other hand, 
an explicit expression for an upper large deviation rate function may be found in [6]. As we will 
see below, for many models the upper bound proved in [6] is actually tight, and thus the main 
difficulty appears in the proof of the large deviation lower bound. There are several reasons for this 
difficulty. One has to do with the fact that queueing systems fall into the category of "processes 
with discontinuous statistics," as defined in [5, 6]. The discontinuities appear because the generator 
of a queueing process often changes abruptly when one or more of the queues becomes empty. 

Before discussing in detail how these discontinuities affect the analysis, it is important to note 
that our proofs will actually be based on a control theoretic representation for the large deviation 
probabilities. This approach has much in common with the change of measure argument often used 
to prove large deviation lower bounds, and in fact each control will correspond to one such change 
of measure. Since the change of measure argument is more widely known in this context, we will 
use the terminology of this technique instead, with an understanding that the there is an equivalent 
phrasing in terms of control representations. 

Thus we resume our consideration of how one may establish a large deviation lower bound, 
and assume that a change of measure has been selected for this purpose. For processes with 
discontinuous statistics it turns out that one must characterize certain properties of the asymptotic 
fractions of time that the process spends in each subregion of smooth statistical behavior. The 
situation is simplest when one can uniquely characterize the fractions of time themselves. However, 
this is not usually possible if the dimension is greater than two, and it is this difficulty which has 
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thwarted a probabilistic proof of even the (relatively) simple case of Jackson networks. 

A method for eliminating this difficulty is one of our main innovations. We show, for the models 
introduced later in the paper, that an argument based on Jensen's inequality allows one to restrict 
the changes of measure that one must consider. In particular, it turns out that given a network 
from among the classes we consider (e.g., a Jackson network), one can restrict to changes of measure 
that return the process to this same class. This is extremely useful when analyzing the asymptotic 
properties of the new process, and the reason has to do with the second main new ingredient we 
use: weak convergence methods and the Skorokhod Problem. For the types of models we treat, 
it is known that under the standard large deviation scaling (which is the same as the law of large 
numbers (LLN) scaling), limits of the queueing system can be characterized as the unique solution 
of a well behaved Skorokhod Problem [9]. Recall that the use of Jensen's inequality discussed above 
allows one to restrict, a priori, to changes of measure that define processes of the same sort as the 
original process. Because they fall into the same class as the original model, the LLN limit of each 
model defined by one of these changes of measure will be characterized by a well behaved SP (which 
is typically not the SP associated with the original model). This is rather convenient, since the 
uniqueness of solutions to a well behaved SP is just what is needed to properly characterize the 
limiting behavior of the process. 

Besides a well behaved Skorokhod Problem, an additional structural property that is needed is 
related to the form of the rate function. As noted previously, the formulation of the rate function 
involves quantities that may be interpreted as limits of fractions of time spent in different subregions 
of smooth statistical behavior. In general, the limits of the fractions of time are not uniquely 
characterized, since there are too few constraints when compared to the number of subregions of 
smooth behavior. However, in the course of applying Jensen's inequality we derive an alternative 
representation for the rate function. This representation involves only functionals of the fractions 
of time that are uniquely characterized, and thus weak convergence methods can be applied to 
prove the desired convergence. 

As an applications of these methods we identify the rate function for two families of continuous 
time jump Markov models. However, the range of applications is somewhat broader, and for 
example one can consider also Markov modulated jump rates, discrete time processes, and other 
variations. The most significant restriction appears to be the requirement that the associated SPs 
must all be regular. 

An outline of the paper is as follows. In Section 2 notation is introduced and the needed 
background material is presented. It is convenient to introduce "localized" versions of the original 
queueing model. These models provide, for a given point in the state space of the original model, 
the simplest model whose large deviations behavior is the same as that of the original model near 
the given point. We then state the control representation for the rate function for the localized 
models, and introduce the Skorokhod Problem, which will be used to identify weak limits in the 
asymptotic analysis of the representation formula. In Section 3 we relate various representations 
for the rate function for the local model, which will eventually appear as the integrand in the rate 
function for the full model. It is in this section that we isolate key parameters in the rate function 
that are needed for its identification. In Section 4 a number of abstract assumptions are made 
regarding these parameters and other properties of the network, and weak convergence methods 
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are applied to complete the identification of the rate function. Finally, in Section 5 we verify the 
assumptions for two families of models. Some minor details are relegated to an Appendix to ease 
the exposition. 

2 Background 

2.1 Stochastic control representation and statement of the LDP 

This subsection summarizes some definitions and results of [3]. Under appropriate assumptions, the 
large deviation principle holds for a general class of jump Markov processes, and the rate function 
may be characterized in terms of rate functions for local models, as explained below (Theorems 2, 
3). The proof of the large deviation principle is based on a control-theoretic representation for the 
probabilities that these processes stay within tubes centered at piecewise linear paths (Theorem 1). 

For T = [0, t] or T = [0, oo) and a Polish space S, we denote by D(T : S) the space of all cadlag 
functions T t— > S. The process used to model the queueing system is a jump Markov process on 
the orthant 7/.) \.r 77 x : (x,e t ) > 0,i = 1,2,..., N}, with paths in D([0,oo) : (In [3] 

more general polyhedra than the orthant are allowed.) The analysis in [3] relies on the introduction 
of local models and the stochastic processes associated with them. These processes take values in 
subsets of Z5 N of the form 77. Xix , where K C {1, 2, . . . , N}, and 

77 Xh = {x e % N : (x, ei) > 0, i e K}. 

Thus for a local model associated with the state space , non-negativity constraints are enforced 

on Xi, i G K, but not otherwise. The original queueing model is associated with ^^_^ l, --- ,N ^ = y/ x _ 
The term full model will be used to distinguish the original queueing model from its related family 
of local models. We denote also 

IR+ = {x e 1R N : (x, e t -> > 0, i = 1, 2, . . . , N}, 

n^' K = { x e n N : (x, ei) > o, i e k}. 

As will be discussed further below, the large deviation behavior of a queueing model can often 
be determined by considering the asymptotics of the probability that the process stays in a small 
neighborhood of a constant velocity trajectory, if we allow the initial position and velocity of the 
trajectory to range over suitable values. Suppose, for example, that the trajectory is y+fit, t £ [0, 1]. 
If the initial point y or terminal point y + (3 of the trajectory satisfy more active constraints of 
the form (e 4 -, x) = than the "interior" of the trajectory {x = y + fit : t G (0, 1)}, then the more 
complicated dynamics in the neighborhood of such an end point are an annoying but relatively 
unimportant nuisance. See Figure 1. 

The appropriate local model will effectively throw away all parts of the process that are unim- 
portant when determining large deviation properties of the full model with regard to such constant 
velocity trajectories. For more details, see the discussion in [3, Section 4] and Example 1 below. 
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Figure 1: A trajectory whose endpoints satisfy more constraints than the interior 

The notation introduced in a moment will be used in the context of both the original and the local 
models. 

We first introduce the notion of a facet. Facets will be the regions of constant statistical 
behavior, and they will vary with the local model under consideration. For K C {1, 2, . . ., N} and 
I C K, let 

T K ^ = {x G U N : (x, e-) = 0, j G J, (x, e t ) > 0, i G K \ I}. 

Recall that for a local model associated with K, only the variables Xi,i G K are constrained to be 
non-negative. Facets are characterized by the subset of indices / for which the inequality constraint 
is tight. For each local model it will be necessary to group those states for which the generator of 
the process takes the same form. Each such group will coincide with the intersection of Z5 N with 
one of the facets associated with the given local model. In this context, notice that for each subset 
K C {1, 2, . . ., N}, the sets T K)I partition IR^' 11 as / ranges over all choices of / C K. Note also 
that every facet is a cone, since x G J- K}1 implies ax G J- K}1 for all a > 0. See Example 1 below 
for the details of a particular case. 

For a Markov process with state space S C /Z N , we let r(x, v) > denote the jump intensity 
from x G S to x + v G S. We extend r(-, •) to S X Z5 N by letting r(x, v ) = for x G S, x + v G Z N \S. 

The following condition on a state space S and on an intensity function r will be needed for 
each local model in order to obtain large deviation properties of the full model. Thus when the 
condition is assumed S and r will vary with the particular local model. However, when parts 2 and 
3 of the condition hold for the full model, they automatically hold for all local models as well. 

Condition 1 There exists K C {1, 2, . . . , N} such that the following holds. 

1. S = 7/. Xh . 

2. For all I C K, r(x,v) is independent of x G J- K}1 fl S, 

3. For all I C K and for every x G J- K}1 fl S, the set {v G : r(x, v ) > 0} is finite. 

A consequence of Condition 1 is that r(x,v) is radially homogeneous in x, and it is uniformly 
bounded above and below by positive constants on the set {(x,v) : r(x,v) > 0}. 
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For a fixed K C {1, 2, . . . , N} (or equivalently, for a fixed state space S = Z5+ ,K ), the set of all 
facets T K%1 are indexed by all subsets / C K. The set of all "possible" jump directions for the full 
model (i.e., all vectors v such that the jump rate from x to x + v is strictly positive) will be denoted 
by V . This set will automatically include all possible jump directions of all associated local models. 

Let {X(t),t G [0,oo)} be a jump Markov process on S C Z5 N , with paths in D([0,oo) : S) 
and jump intensities r(-,-), such that S and r satisfy Condition 1. For n G IN let the processes 
{X (t), t G [0, 1]} be defined by 

X n (t) = -X(nt), 
n 

and let the corresponding state spaces be denoted by 

S n = l-x :xeS 
I n 

For fi G IR^, n G IN, y G S n , and e G (0, 1), let 

p n (y;/3,e)=P;( sup \\X n (t) - tfi\\ < e } , 
U[o,i] J 

where Pff denotes probability conditioned on X n (0) = y, and || • || denotes the Euclidean norm on 
IR^. Let also 

g n (y;/3,e)= - -log p n (y; [3, t), 
n 

where — log = 00. 

All of the definitions given above are central to the localization procedure used in [3]. A rough 
explanation of how they are used is the following. In process level large deviations, one can often 
deduce the full LDP (cf. Theorem 3 below) if one knows the asymptotic behavior of the probability 
of staying in a small neighborhood of a given trajectory. An approximation argument can be used 
to show that it suffices to consider only trajectories that are piecewise linear, and one can then 
use the Markov property to simplify even further, and restrict attention to the time intervals on 
which the trajectory has constant velocity. Suppose that the time interval of interest is [a, b]. Now 
it turns out that the large deviations behavior over a short time interval, conditioned on starting 
at a point y, will depend only on the form of the generator near y (see [3]). This implies that 
not all aspects of the full model are relevant when determining the probability that the process 
stays near the trajectory during such a time interval. In fact, if the relative interior of the given 
segment of the trajectory lies in the facet jri 1 ----^}^ Q f the full model, then the asymptotics of 
these probabilities are entirely determined by the behavior of a local model with state space 
If a given segment of a trajectory has velocity fi and if X (respectively, X n ) is the appropriate 
local model (respectively, scaled local model), then asymptotic properties of p n identify the large 
deviation properties of the full model over the given time interval (cf. Theorems 2 and 3 below). 



Example 1 Consider a 3-dimensional model and a trajectory of the form {x = y + fit, t G (0, 1)}. 
If y x = y 3 = 0, fi\ = /?3 = and yi A (?/2 + P2) > 0, then the proper local model corresponds to 
K = {1,3}. See Figure 2. 
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Figure 2: Localization for a three dimensional model 

The four regions of the state space that are relevant are the intersections of ^ 3 with 

At = {(xi, x 2 , x 3 ) : xi = 0, x 2 > 0, x 3 = 0}, 

A 2 = {(xi, x 2 , x 3 ) : xi = 0, x 2 > 0, x 3 > 0}, 

^3 = {(xi, x 2 , x 3 ) : xi > 0, x 2 > 0, a; 3 = 0}, 

A 4 = {(xi, £ 2 , £3) : £1 > 0, x 2 > 0, £ 3 > 0}. 

The trajectory is a subset of A\, but small neighborhoods of the trajectory also intersect the other 
three sets. The sets A\, A4 make up just 4 of the 8 facets of the full 3 dimensional model. All 
other facets are unimportant since they are a positive distance from the trajectory. The localized 
model has state space Z5^ ll ' 3 \ and facets 

B x = {(x Xl x 2 , x 3 ) : x x = 0, x 3 = 0}, 

B 2 = {(xi, x 2 , x 3 ) : x x = 0, x 3 > 0}, 

B 3 = {(xi, x 2 , x 3 ) : x x > 0, x 3 = 0}, 

B 4 = {(xi, x 2 , x 3 ) : xi > 0, x 3 > 0}. 

To analyze the probability that a local model follows a linear trajectory, one can consider a 
representation for the pre-limit objects q n (y; (3, e) in terms of a stochastic control problem (see also 
[4] for a full exposition of this approach). We recall this representation below in Theorem 1. To 
state it we need a few definitions. 

Let n G IN be fixed. A control u n (y,v,t) is a measurable function mapping S n x % N x [0, 1] 
into [0, 00). We will impose the following condition on a control: for all (y, u, £) G S n X ^ w X [0, 1] 

r(y, u ) = implies u n (y, v, t) = 0. (1) 

A Markov process {S n (t),t G [0, 1]} with state space S n and jump intensity u n (y,v,t) from y to 
y -\- v/n at time £ (with y G u G ^ w and £ G [0, 1]) is called a controlled Markov process 
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associated with u n . The generator of the controlled process is given by 

(C n f)(y,t)=n J2 u n (y,v,t)[f(y+v/n)-f(y)], 
ve% N 

where / : S n — > IR is continuous and bounded. The dependence of C n on u n is omitted in our 
notation. The original queueing model corresponds to the particular choice u n (y,v,t) = r(y,v). 

The controls that are considered are not required to be bounded, and in fact it will be necessary 
to consider unbounded controls. Existence of controlled processes is therefore not automatic. Let 
Q denote the set of all bounded functions ip : S n X [0, 1] — > IR for which (p(y,t) is continuously 
differentiate in t. We say that the control u n has an associated controlled process if there exists a 
Markov process {S n (t),t G [0, 1]} on some probability space such that H ra (0) = y with probability 
1 and such that 

<p(B n (t),t) - <p(y, 0) - ( d<P( S 2 S),S) + ^MS n (*), *)) ds (2) 

is a martingale in t for t G [0, 1] for all (f> G Q, and if all processes satisfying these two conditions 
have the same distribution. In particular, every bounded control with a finite number of jump 
directions has an associated controlled process. We refer to a control u n as an admissible control if 
it satisfies (1) and has an associated controlled process. 

For <p G D([0, 1] : S n ), (3 G E w , and e > 0, let the infinite exit cost be defined as 

, . . J oo if there is t G [0, 1] such that \\<f(t) — tf3\\ > e, 

*^M=j otherwise . 

For a G IR, let 

01 \ ■ \ a log a — a+1 if a > 0, 
1 oo otherwise, 

where by convention 01og0 = and 0^(0/0) = 0. Note that I is nonnegative and that it has 
superlinear growth, i.e., 

lim 1(a) la = oo. 
Let Ey denote expectation conditioned on H ra (0) = y. 

Theorem 1 (Dupuis and Ellis [3]) Assume Condition 1. Then for each n G IN, y G S n , fi G IR W , 
and e > 0, one has 



q n (y;p,e) = mfE;{ I ^ r(S n (t),v)£ ( )^ \ ' ) dt + g(S n ;p,e) } , (3) 

where the infimum is taken over all admissible controls u n and associated controlled processes 
We use the term running cost for the integral in (3), namely for 
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and the term expected running cost for its expectation under P™. By the exit cost and the expected 
exit cost we refer to g(S n ; (3, e) and to its expectation under P™, respectively. 

We define also a finite exit cost and attach to it a control problem similar to the one considered 
in Theorem 1, the only difference being in the exit cost. For ip G P([0, f] : S n ), fi G Wl N , e > and 
M > 0, let 

9m{v; /V) =g(v; /V) am, 

and define 

q n M (y; P, 6) = inf P; I f Q r(* n (t),v)l ( ^^^J^ ) dt + </m(3- /3, e) J , (4) 
where the infimum is again taken over all admissible controls u n and associated controlled processes 

The following condition on the model is referred to in [3] as the Communication/Controllability 
Condition. Although a weaker condition is also stated in [3] under which the LDP is proved, the 
condition below suffices to cover the models that we consider. If this condition holds for the full 
model, then it will hold for all local models that are obtained from it as well (under Condition f). 

Condition 2 There exists a number Kq such that for each pair of points x and y in S there exists 
J G IN satisfying J < /i'o||£ — y\\, an d a sequence of points {xo, x\, . . . , xj} in S, for which xq = x, 
xj = y, and r(xj, Xj + i — Xj) > for all j = 0, f , . . . , J — 1 . 

The next theorem asserts that certain large deviation limits exist for a local model with state 
space if one considers a constant velocity trajectory for which the velocity fi lies in T ' . 

Thus if the state space imposes a non-negativity constraint (x, e 8 ) = 0, then (/3, e 8 ) = 0. Note that 
such a trajectory will lie in the closure of all facets of the local model. As shown in [3], these turn 
out to be the only velocities needed in order to define the rate function at the process level. The 
first part of the theorem is taken from [3]. The second part concerns the related finite exit cost 
problem, and its proof is deferred to the Appendix. 

Theorem 2 (Dupuis and Ellis [3]) Let S and r be given and assume Conditions 1 and 2 are 
satisfied. Let K be as in Condition 1, and let fi G T K,K be given. Then there exists a number 
L(P) G [0, oo) such that the following holds. 

1. 

limlimliminf inf q n (y;(3,e) 

£->0 8->0 ra-»oo {yeS n :\\y\\<8} 

= lim lim lim sup sup q n (y;(3,e) 

e->0 5->0 n -»oo {2/eS":||2/||<5} 
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Figure 3: The state spaces S and intensity functions f for different values of I in a two dimensional 
example 



2. 



lim limlimliminf inf (w; /3, e) 

M-)-oo e-)-0 5-)-0 n-^oo {2/e5":||2/||<5} 

= lim lim lim lim sup sup (y; (3 , e) 

M->oo e->0 5->0 n ->oo {2/£5 n :||2/||<5} 

= L{fi). 



Remark: Under the assumptions of Theorem 2, it is proved in [3] that the function L( 
and convex on the linear subspace T K,K . 



is finite 



As discussed previously, the result of Theorem 2 on the local model serves as the basis for the full 
LDP. We next define the rate function for the full LDP in terms of the rate functions associated with 
local models. We therefore return to the full model with state space S = and a given intensity 
function r, and assume for this full model that Conditions 1 and 2 are satisfied (with K in those 
conditions equal to {1, 2, . . . , N}). Consider a point x in M^, and let I = I(x) = {i : (x, e 8 ) = 0}. 
Then the local model associated with such a point has state space S = , and the only facet 

on which the local rate function need be defined is T 1,1 . The correct localized version of the jump 
intensities is defined for any v G % N as follows. If x is in one of the possible facets T IM , M C I of 
the local model, then we associate a point that is in the intersection of this facet with the original 
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state space, but away from all non-negativity constraints save those indexed by /. In other words, 
x G jri 1 '"-!^},^ p| g yy e then set r(x,v) = r(x,v). Loosely speaking, in the manner suggested 
by Example 1 we throw away the parts of the generator associated parts of the domain where 
constraints other than those in / are active, and in a corresponding manner extend the state space. 
A simple two dimensional example is illustrated in Figure 3 for the case of a Jackson network. 
We automatically get that the assumptions of Theorem 2 are satisfied by each possible (S,r), and 
therefore conclude for each local model the existence of a function Lj(-) defined on T 1,1 . We then 
define L(x,f3) for any x G IR+ and (3 G M> N by L(x,f3) = Li{(3) whenever / = I(x) and (3 G J- 1 ' 1 , 
and L(x,(3) = oo otherwise. 

For S = U N or S = M^, let T([0, 1] : S) denote the subset of D([0, 1] : S) of piecewise linear 
functions whose derivative has finitely many discontinuities. For x G IR+ and <j> G T([0, 1] : K+) 
satisfying <^>(0) = x, let 



J x {cp)= / L{ct>{t),(t>{t))dt. 
Jo 

Since <f> G T([0, 1] : we have that <f>(t) is well defined, and moreover on 

the intervals where (f> is constant. Hence L((f>(t) , (f>(t)) < oo for all but finitely many points t, and 
therefore J x {4>) is well defined and finite. For all other x G K W and <j> G -D([0, 1], M w ), we set 
J x (<f>) = oo. 

Let •) denote the Skorokhod metric on L>([0, 1] : IR^), and for tp G D([0, 1] : IR^) and a > 

let 

B a ty, «)={(£ D([0, 1] : U N ) : a((, < a}. 

Finally, the lower semicontinuous regularization of J x is denoted by J x , namely for x G M> N and 
^eD([0,l]:U N ), 

J x (-0) = lim inf inf J y ((/)) . 

yeTR N :\\y-x\\<a <t>eB^(iP,a) 

Theorem 3 (Dupuis and Ellis [3]) Assume Conditions 1 and 2. Let x G IR+ . Then the 
following conclusions hold. 

1. The function J x is nonnegative and lower semicontinuous on D([0, 1] : M w ). Furthermore, 
for any compact set C C M> N and any M G [0, oo), the set 

\J{4,eD([0,l]:U N ):J x ^)<M} 

xEC 

is compact in D([0, 1] : M w ). 

2. For any open set G in D([0, 1] : M> N ) and each x G M> N we have the large deviation lower- 
bound 

lim lim inf inf -loeP^(X n £(?)>- inf JJib). 

8^0 n^co ye s n :\\y-x\\<8 U y ipeG 

3. For any closed set F in D([0, 1] : M> N ) and each x G M> N we have the large deviation upper- 
bound 

lim lim sup sup -logP"(X n G F) < - inf J x (ip). 

8^0 n^-oo y( zs n :\\y-x\\<8 n 
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With the preceding result available, all that remains is to identify the rate function J x . Although 
it can easily be identified in certain special cases (e.g., for stable 2-dimensional models), this 
identification is in general a rather difficult problem. In the next two sections we introduce tools 
that will allow us to precisely characterize L(x,(3), and then apply them in Section 5 to two 
interesting classes of multidimensional models. This will identify the function J x on T([0, 1] : 
As we will see, a necessary condition for these methods to work is that the upper large deviation 
rate function obtained in [6] must equal J x on T([0, 1] : This fact and regularity properties 

of the various rate functions will be used in Section 4 to show that J x takes the expected form 



for all absolutely continuous functions (f>. 
2.2 The Skorokhod Problem 

In this subsection we give the precise definition of the Skorokhod Problem. The regularity properties 
of this problem play a key role in characterizing the limits of the controlled processes H ra , which 
are in turn used to evaluate the limits of the quantities q n that appear in Theorem 2. 

For a closed set G C E w , let 



For each point x on the boundary of G one is given a set d(x) of unit vectors in IR . The 
Skorokhod Map assigns to every path tjj £ Dq([0, 1] : M, N ) a path cf) that starts at <^>(0) = ip(0), 
but is constrained to G, in such a way that whenever it is in the interior of G it is obtained by a 
translation of ip, while on the boundary an additional "force" is used, allowed only in the directions 
defined by d(x), so as to keep the path inside G. The definition of the Skorokhod Problem is stated 
below. For i] £ D G ([0, 1] : B- N ) and t £ [0, 1] we let \i]\(t) denote the total variation of rj on [0,t] 
with respect to the Euclidean norm on M, N . 

Definition 1 Let tjj £ Dq([0, 1] : M, N ) be given. Then (cf), rj) solves the SP for tjj with respect to G 
and d if (f)(0) = ip(0), and if for all t £ [0, 1] one has 



5. There exists a Borel measurable function j : [0, 1] — > IR such that d\i]\-almost everywhere 
one has j(t) £ d((f>(t)), and such that 




DodO, 1] : M N ) = W e D([0, 1] : M N ) : ^(0) £ G}. 



1. 4>(t) = ^{t) + i 1 {t), 

2. <f>(t) £ G, 

3. \ri\(t) < oo, 

4. \ V \(t) = J m l{m £ dG}d\ v \(0) 



V (t)= f j(0)d\r,\(0) 



J[o,t] 
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Note that r] changes only when (f> is on the boundary, and only in the directions d((f>). The function 
T defined by (f> = T(ip), on the domain where there is a unique solution to the SP, is called the 
Skorokhod Map. The functions ip, (f> and r] are referred to as the unconstrained, the constrained 
and the constraining paths, respectively. 

In this paper we shall consider only sets G of the following form: 

G= f] {x e M N : (x, n t ) > 0}, (5) 

8 = 1 

for some finite set of unit vectors {n 4 -, i = 1, 2, . . ., q}. To any vector n 4 - we attach a unit vector d{ 
such that (di, ra 8 ) > 0. For x £ dG let I(x) = {i : (x, ra 8 ) = 0}. For any x £ dG we define 

d(x) = < 7 = ^2 a ^ '■ a * - °> IItII = 1 \ • (6) 

It is not assumed that {n 4 -, i = 1, 2, . . ., q} provides a minimal representation for G, and in fact 
this redundancy is necessary in some cases (including one of our examples) in order to allow for 
the proper full set of directions d(x). 

Both the domain G and the sets of directions d(x) are now entirely defined by {(rii,di) : i = 
1,2,..., q}, and therefore so is the SP. 

Definition 2 A Skorokhod Map T is called regular if it is defined on a subset Q of Dq([0, 1] : M, N ) 
that includes all paths of bounded variation, and if it is Lipschitz continuous in that there is K\ < oo 
such that for all ipi and ip? ?n G 

sup ||r(^)W-r(^)WII < i<i sup WW) - Mt)\\- 

te[o,i] te[o,i] 

A SP is called regular if the corresponding SM is regular. If a SM is regular then there exists 
a unique Lipschitz continuous extension of T to all of Dq([0, 1] : M, N ) [7], and with an abuse of 
notation this extension will also be denoted by T. 



3 Representations for the Local Rate Function 

As discussed in Section 2, local models will be used to calculate the rate function for a full queueing 
model. The local models simplify the problem by eliminating from consideration those parts of the 
process that are not involved in determining the rate function L(x,(3) at a point x. However, in 
all cases (save the local model associated with the interior of the full model) we are still obliged to 
deal with processes whose generator is discontinuous in the state variable. 

As discussed previously, if one wishes to consider the probability that the rescaled process X n 
stays near a trajectory that evolves in an (N — to)— dimensional facet, then one must consider how 
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this likelihood is affected by the form of the generator in each nearby region of constant statistical 
behavior (i.e., each facet whose closure contains the given (N — to) — dimensional facet). Typically, 
there will be 2 m regions of different statistical behavior that will be relevant for such a trajectory. 
A key quantity involved is the asymptotic fraction of time that the controlled process spends in 
each such region, and in general it is very hard to obtain this sort of information. However, when 
certain structural properties are present, one can identify alternative variables that provide all the 
information needed to identify the rate function, and yet which are easily obtained as a function the 
jump rates and jump vectors. These variables in fact arise from alternative representations of the 
local rate function, and the first results of this section will introduce these variables and indicate 
their connection with standard representations of the local rate function. 

We therefore return to the "local model" setting of Theorem 1, and assume that a state space 
S and an intensity function r are given, and that they satisfy Condition 1. The set K appearing 
in Condition 1 will be arbitrary but fixed. According to Section 2.1, the rate function for the 
full model will be determined if we identify, for each such K, the function L((i) that satisfies the 
conclusion of Theorem 2 for all fi G T K,K . As in Section 2.1, to simplify the notation we omit the 
dependence on K from S, L((3), r(x, v), and so on. 

We recall that if x is in the facet T K%1 of the local model, then there are no constraints on X{ if 
i (j£ K , that Xi > if i G K , and that X{ = if in addition i £ I. According to Condition 1, r(x, v) 
is independent of x for all x belonging to a given facet T K%1 , I C K. It is convenient to introduce 
the notation rj tV = r(x,v), where x is any point in T K%1 . Thus rj tV is the jump intensity from any 
location in the facet T K%1 in direction v G ffi N . Unless explicitly noted otherwise, all sums on / in 
this section will be over all subsets of K , and all sums on v will be over V . 

For numbers that we denote by pi, ui tV , r v and c v , we shall consider the following sets of condi- 
tions: 

' Pi>0, 

ur v > 0, y > 



ri, v = => u I:V = 0, 

J2piui )V v = ft. (7a) 

I,v 

{ Pi>0, 

J2iPi = i, 

f v = T,lPl r I,vi (8) 

c v > 0, 

Loosely speaking, these quantities may be interpreted as follows: ui tV will be the controlled jump 
rate in facet T K%1 ] pi represents the asymptotic fraction of time that the process which uses these 
controls spends in this facet, so that equation (7a) implies that the mean velocity of the controlled 
process is (3; r v is the average of the original jump rates in the direction v, with the averaging done 
according to the weights pi; c v is a multiplier that is independent of /, and which determines the 
ration of ui tV /ri tV . Thus the second set of conditions represents a more structured system, since 
we essentially consider only controls with a certain type of independence from /. 
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Typically, there will be N equality constraints on the pi from the velocity equation (7a), and one 
further equality constraint from the fact that the p/'s are a probability vector. On the other hand, 
the number of unknown values of pi can be as large as 2 N . This in part explains the simplicity of 
the stable two dimensional case, which one can in fact reduce to the case N = 1 [11]. In general, 
however, the p/'s are not well defined. 

In spite of this ambiguity, for some models one can use Jensen's inequality to restrict the class 
of controls under consideration, and in this restricted class replace the p/'s by a weighted average, 
as suggested by the middle equality in (8). In such circumstances, the class of controlled models 
will be indexed by the collection of multipliers {c v , v G V}. Thus one can consider these multipliers 
as playing the role formerly played by the ui tV . The significance of the parameters r v will then 
follow from the equality of (9) and (10) below, since the independence of c v from / means that the 
sum on / in (7a) can be taken first, effectively replacing the quantities piri tV (with ri tV known but 
pi unknown) by the quantities r v , and (7a) by the last equality in (8). 

Now all this would be of little consequence if the r v were as poorly defined as the pi, but for 
the models we consider this is not true. In fact, it will turn out that the third and fifth equalities 
in (8) will provide the same number of equality relations as the cardinality of V . 

We begin with a lower bound on the large deviations rate function for the local model, which 
corresponds to a large deviation upper bound. 

Lemma 1 Under Conditions 1 and 2 

L((3) > inf < ^ piri,v£( u l,v/ r l,v) '■ Numbers pi and u^ v satisfying (7) and (7a) > (9) 

U* ' ' ' J 

= inf ^ J^r„£(c.y) : Numbers pi, r v and c v satisfying (8)j- (10) 

Remark: The bound (9), though with rather different notation, has already been established in 
[6]. 

Proof: We consider the following relaxed version of (7a): 

< e. (76) 
It will be shown below that for any n G IN, y G S n , (3 G M> N , 8 > and e > 0, such that ||y|| < 8 < e, 
q n (y;/3,e) > inf < ^ pifi,v^( u l,v/ r l,v) '■ Numbers pi and uj tV satisfying (7) and (76) > . (11) 

J 

Given this inequality, part 1 of Theorem 2 and the lower semicontinuity of £ imply (9). Moreover, 
if the numbers pi, r v and c v satisfy (8) and if ui tV = c v ri tV , then pi and ui tV satisfy (7) and 
(7a). Therefore the right hand side of (9) is less than or equal to the expression in (10). On 



y + J2p iUi > vV ~ P 

I,v 
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the other hand, for any pi and ui tV satisfying (7) and (7a), the numbers pi, r v = J2iPl r l,v an d 
c v = J2i Pl u l,v/ J2i Pl r l,v satisfy (8). Jensen's inequality applied to the convex function £ implies 
that 

^2piri, v e{u ItV /r ItV ) > ^2f v £(c v ), 

I,v v 

and thus (10) follows. We therefore turn to the proof of (11), relying on the representation (3). 

Obviously, the exit cost in (3) may be omitted if we infimize over all controls which make it 
vanish, namely 

g"( y ;/3, e ) = inf^|/ o 1 g^r(S»(t), (^|^) , (12) 

where the infimum is taken over the set of admissible controls u n for which the associated controlled 
processes H ra satisfy 

P;( 5 (H n ;/3,e) = 0) = l. (13) 

We use the following notation: 
p n y {I)=E n y {\{^{t)&T K ^}dt, and u; {I ,v) = ^-E n y f u(~ n (t), v, t)l{B n (t) G T K ^}dt 

J Py \-l ) J 

(14) 

if Py(I) > 0, while Uy(I, v) =0 if Py(I) = 0. Suppose that we rewrite the expected running cost in 
(12) as 

It then follows from Jensen's inequality [applied to the convex function £(■)] that (12) is greater 
than or equal to 

Y,Py(I)riA^(h v)/r IiV ). 

I,v 

Since u n is an admissible control, we know that ri tV = implies U™ (I, v) = 0, and also that for 
any M > and j G {1, 2, . . . , N} the process defined in (2) is a martingale if (p(x, t) = (x, ej) A M. 
Taking M — > oo, one obtains by the monotone convergence theorem that 

E;~ n (l) = y + E; f u n (S n (t),v, t)v=y + Y. PyW{I, v)v. 

The constraint (13) implies that pi = p'y(I) and ui tV = U'y(I,v) satisfy (7) and (76). This proves 

(11), and completes the proof of the lemma. 

□ 

We next derive an upper bound on the rate function L((3). The proof of the lower bound just 
given suggests that it may be sufficient to consider controls that are constant in the time variable. 
If such a control is used, and if in addition it is of the special form associated with the set of 
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constraints (8) (i.e., u n (x,v,t) = c v rj tV for x £ T K%1 , I C K, v £ 77. x . and t £ [0,1]), then the 
definition (4) of implies that 

?Af(i/;M < YiPvVMicv) +MP;(sup||H n (i) - (3t\\ > e) (15) 

for any such admissible control u n . 

We introduce two conditions that will be needed in the treatment of the upper bound. Both 
parts refer to (8). The first states that the collection {r v }, when viewed as a function of the {c v } 
and (3 £ T K,K , is unique (if it exists) and continuous. When this property holds it identifies the 
r v as a quantity that is easier to work with (when compared to the p/'s). The second part of the 
condition is less significant, and simply requires that for any solution c v > 0, v £ V, there exist a 
nearby solution with c v > 0, v £ V. 

Condition 3 We consider S and r satisfying Condition 1 and let K be such that S = 77.^ ix . 

1. Letc v > 0, v £ V be given. Then for all e > there exists 6 > such that if ({pi}icl< > \fv\vev-, 
{c v } veV ,(3) and ({pj}/ C A', {r' v }vev, {c v }vev, ft) satisfy (8), where (3,(3' £ T K ' K and ||/3 - 
(3'\\ < 8, then \r v — r' v \ < e, v £ V . 

2. Let the quadruple ({pi}icK, \fv\veVi { c v}veV, ft), where c v > 0, v £ V and (3 £ T K,K , satisfy 

(8). Then there exists an M < oo such that for all e > there exists ({p'j}icKi {r' v }vev, { c ' v }veV, ft) 
satisfying (8), such that 

• c' v > 0, v £ V, ft £ T K ' K , \r v - r' v \ < e, v £ V, and \\(3 - ft\\ < e, 

• for v £ {w £ V : r w / 0} \c v — c' v \ < e, and for v £ {w £ V : r w = 0} c' v < M. 

Note that part 1 of Condition 3 implies uniqueness of the numbers {r v } given (3 and c v > 0. However, 
existence of r v is not assumed. Also, we reiterate that it may happen (and indeed will happen in 
our applications) that there is uniqueness of the {r v }, without uniqueness of the corresponding 
{pi} appearing in (8). 

Our proof of the lower bound is based on weak convergence of the trajectories of the controlled 
processes to the solution of a related SP. The following condition asserts that this SP is well behaved. 
The first part simply asserts that the domain of the SP and that of the large deviation problem are 
compatible, while the second states the nonnegativity relation between normals and directions of 
constraint that is needed in the SP. To identify the LLN limit of the controlled queueing network 
as the solution of a SP, we will represent the system as an average "drift" perturbed by correction 
terms. The fourth part ensures that these correction terms point in a direction that is consistent 
with the given SP. The key part is 3, which will identify the solution to the SP as the unique LLN 
limit of the controlled queueing system. 

Condition 4 Consider S and r satisfying Condition 1 and let K be such that S = 7/.^.' ix . Then 
for every set of numbers c v > 0, v £ V , for which (8) holds with some (3 £ T K,K , {pi} and {r v }, 
there exist a number q £ IN, and unit vectors {n 4 -, i = 1, 2, . . . , q} and {di, i = 1, 2, . . . , q} (that may 
depend on {c v }), such that the following hold. 
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1. IR 



N,K 



f]tii x e B. N ■ (x,n t ) > 0}. 



2. (di,ni) > for i= 1,2,..., q. 

3. The SM associated with {(rii,di),i = 1, 2, . . . , q} is regular in the sense of Section 2.2. 

4. For any I C K, 7/0, ifJ2 v c v(ri, v ~ rq, v )v / then 



J2 V c v (ri,v ~ r% v )v 



G d(i 



II J2v Cv{ri, v ~ r$, v )v\ 
where d(-) is defined by (6) with the d{ as given above. 



e T 



K,I 



Note that we do not assume the representation in part 1 is a minimal representation for M + 
We have the following. 



N,K 



7 N,K 



Proposition 1 Consider S and r satisfying Conditions 1 and 4, and let K be such that S = % + 
Let c v > 0, v G V and fi G fr e gi ven> an d assume (8) is satisfied with some {pi} and {r v }. 

For n G IN let u n be an admissible control defined by u n (x,v,t) = c v ri^ v for x G T K%1 , I C K, 
v G V and t G [0, 1]. Then for all e > 0, 



0. 



(16) 



Note that according to our notation P ( 5 l (H ra (0) = 0) = 1. 

The proof of this proposition is given in Section 4. Let us show that an upper bound on the 
rate function follows, under all of Conditions 1, 2, 3 and 4. 

Let fi G j^l^A k e fixed. Let c v > 0, v G V be such that there exist {pi} and {r v } for which 
(8) holds with the above (3. Note that the {r v } are unique, by part 1 of Condition 3. Furthermore, 
recall the notation in (14), and observe that in terms of this notation 



Ka n (i) 



c v rr v v. 



I,v 



It is easy to verify the bound sup n Eq ||H ra (l)|| 2 < 00. Therefore by (16) 



lim sup 



^2po{I)c v r ItV v - [i 



I,v 



< e. 



Part 1 of Condition 3 then implies that for all v G V 



lim sup 



/here a f — > as e — > 0. By part 2 of Theorem 2 and (15) we obtain 

L(B) < lim inf lim inf lim inf Om(0; (3, e) < f v £(c 
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One may now take infimum over all c v > 0, v G V, {pi}, I C K and {r v }, v G V for which (8) 
holds. The bound that one obtains still differs from the expression in (10), where all the numbers 
{c v } are allowed to be zero. We now show that the two infima are the same. Suppose that we 
are given quantities that satisfy the conditions (8), with c v > 0, v G V. For e > let the primed 
versions be associated as in part 2 of Condition 3. Hence c' v > 0, v G V, and using the inequality 
just proved and the bounds \r v — r' v \ < e, c' v < M, v G {w G V : f w = 0}, we obtain 

L(P) < L(f3)-L(f3')+J2r'Ac'v) < L(P)-L((3')+ £ r'J(c' v ) +#{v G V : r v = 0}e[lV£(M)]. 

We also have that fi' G T K > K , \c v - c' v \ < e for v G {w G F : f w / 0}, - < e and ||/3 - /3'|| < e. 
Recall that i(a) is continuous for a G K+, and L(/3) is lower semi-continuous for fi G T K,K . Sending 
e — t- we obtain the desired inequality with the unprimed quantities. When combined with Lemma 
1 for the lower bound we obtain the following, which is the main result of this paper. 

Theorem 4 Let S and r be fixed, assume Conditions 1, 2, 3 and J h and let K be such that 
S = 7/.^' ix . Then the rate junction for the local model is given by 

L{f3) = inf -|^?\/(c^) : Numbers pi, r v and c v satisfying (8) j- . (17) 

As discussed at the end of Section 2.1, the rate function for the full model (on path space) is 
obtained as follows. We assume Conditions 1, 2, 3 and 4 for the full model. For each point x G 
we let / = I(x) = {i : (a;,e 8 ) = 0}, and then associate to this point the proper local model. As 
we will see, Conditions 1, 2, 3 and 4 then hold for the local model, and Theorem 4 identifies the 
function L(x,/3) as Lj(/3) for fi G J- 1,1 . We let L(x,/3) = oo in all other cases. By Theorem 3 the 
large deviation principle for the sequence {X n ,n G IN} holds with the rate function J x ((f>), which 
is the lower semicontinuous regularization of the function J x defined on T([0, 1] : ) by 

j x (cp)= C L(cp(t)A(t))dt 
Jo 

if (f>{0) = x, and oo otherwise. 

The following theorem provides an explicit formula for J x . 

Theorem 5 Assume Conditions 1, 2, 3 and J h and define L(x,(3) and J x {4>) as in the last para- 
graph. Then the following representation holds: 

J x (cp) = C L(cp(t)^(t))dt 
Jo 

if 4>(0) = x and if cf) G D([0, 1] : H+) is absolutely continuous, while J x (4>) = oo in all other cases. 

Proof: The function J x (4>) coincides on T([0, 1] : IR+) with the upper large deviation rate function 
obtained in [6]. In the notation of this paper, the upper rate function of [6] is just 

j x (cp) = ^ L(cp(t)^(t))dt 
Jo 
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if ^>(0) = x and if <f> G -D([0, 1] : H + ) is absolutely continuous, and J x {4>) = oo otherwise. Since it 
is proved in [6] that 

[<t> G D([0, 1] : K+) : ^ (o) (0) < M, <^>(0) G c} 

is compact for any compact set C C K W and M < oo, is lower semicontinuous. Since J x is 
the lower semicontinuous regularization of a function that agrees with J x on a dense subset of 
its domain of definition, j x (<f>) < J x (cf>) for all D([0, 1] : 1R+). To prove the reverse inequality, 
we must show that given cf) £ D([0, 1] : K+) and e > there is cf) e G T([0, 1] : K+) such that 

Jx(<f>e) < Jx(<f>) + 6 an( l su PtG[o,i] ll^(^) — ^ 6 - However, the construction of such a function 

can be carried out using an argument similar to the one used to prove Lemma 7.5.4 of [4], and is 
omitted. 
□ 

In Section 5 we will verify all the assumptions of Theorem 4 for some interesting models. The 
following monotonicity result of the regularity of the SM shows that if 

part 4 of Condition 4 holds for the full model, then it holds for all corresponding local models 
as well (for a different version see [9]). The proof is given in the Appendix. It is easy to check that 
all remaining parts of this condition also hold once they are verified for the full model. 

Lemma 2 Consider a SM on G C K W associated with {(n 4 -, di), i = 1, . . . , q}, where (in, di) > 
and G is as in (5), and assume that the SM is regular. Let x G dG be fixed, and let I(x) = {i : 
(m, x) = 0}. Then the SM associated with {(n 4 -, di), i G !(%)} is also regular. 

4 Weak convergence considerations 

This section contains the proof of Proposition 1, which is based on showing that {H ra } converges 
in distribution to the solution of the SP defined in Condition 4. We use several ideas from [7], and 
in particular, the proof of Lemma 5 closely follows that of [7, Theorem 3.2]. 

We assume that a state space S and an intensity function r satisfying Condition 1 are given. 
Moreover, letting K be as in Condition 1, we are given also [i G T K ' K . On S n , n G IN we consider 
controls of the form u n (x,v) = ui tV , x £ I , I C K and v G V, and jump Markov processes 
S n (t), t G [0,1] starting at zero with probability 1, and with jump intensity u n (x,v) from x to 
x + v. As in Section 3, the sum on / will always be over subsets of K, and the sum on v will be over 
V. We assume that for some c v > 0, v G V, the controlled jump rates are given by ui tV = c v ri tV . 
We recall that indexes the interior facet relative to the local model, i.e., T K ^. Further, we are 
given a SP {(n 4 -, di), i = 1, . . ., q} that satisfies Condition 4. 

The proof will make use of a jump Markov process (X n (t), Y n (t)) on S n X {n~ x Z N ). The 
process will start at (0,0) with probability 1. The Y n component will be a homogeneous jump 
Markov process on ra _1 Z5 N , whose generator will always be the same as that of H ra for points in 
the interior T K ^, while X n will be equal in law to H ra . Moreover, X n and Y n will have identical 
increments when X n (t) G T K ^ , and independent increments when X n (t) (j£ T K ^ . For such a pair 
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process the jump intensities are as follows. Let x' = x + v. Then the jump intensity M ra (x, v) from 
x = (x,y) to x' is given by: 

if either x £ an d x ' = (x + n~ 1 v, y + n~ 1 v ) 

or x (jt J-^' v and x = (x, y + n v) 



u x, v = < 



nUl jV if X £ .F^'^ ^ jrK,t> an( -| X ' _ ^ _|_ n 1^ 

, otherwise. 



We define Z n (t) = X n (t)-Y n (t), t £ [0, 1] and let Z n = Z n + Z n be the Doob-Meyer decompo- 
sition of Z n , where Z n is a process that is predictable on the filtration generated by (X n ,Y n ), and 
Z n is a martingale on the same filtration. If one considers the process Y n as an unconstrained ver- 
sion of the controlled queueing system, and X n as the "correct" constrained version (corresponding 
to ip and (f> respectively in the formulation of the SP in Section 2), then Z n (and more precisely 
Z n ) will play the role of the constraining term r] in the SP. This correspondence will turn out to 
be exact in the limit n — > oo. 

To prove this fact, we define 

7 W = \ 

[ otherwise 

for t £ [0, 1], and let R n (t) = \Z n \(t). Denoting the closed unit sphere in IR^ by B(0, 1), we let 

S' = IR^ A " X 5(0,1), 

and define measures /i" on E = [0, 1] X £' by 



fi n ([0,t] x A) = l{(X n (0),r(0)) £ A}dR n {9). 
J[o,t] 

Proposition 1 will turn out to be a consequence of the following three lemmas. For the notation 
used in the statements we refer to Condition 4. 

Lemma 3 Let 

V 

and for t £ [0,1] let 

xj){t) = (3 t, 4>{t) = fit, and r](t) = {(3- (3 )t. 
Then ((f), rj) solve the SP for if with respect to M,^ ,K and {(n 4 -, di), i = 1, q}. 

Lemma 4 The family {(X n , Y n , Z n , Z n , R n , ^i n ),ne IN} is tight. 
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Lemma 5 Consider any subsequence of 

(X n ,Y n ,Z n ,Z n ,R n ,fi n ), 

and let 

(X,Y,Z, Z,R,fi) 

denote the limit of a weakly convergent sub sub sequence. Then w.p.l (X , Z) solves the SP for Y 
with respect to WL^ ,K and {(n 4 -, df), i = 1, q}. 

Before giving the proofs of these lemmas, we show that Proposition 1 follows. It follows from 
the standard functional law of large numbers that the weak limit of Y n is exactly ip of Lemma 
3. Note that regularity of the SM implies uniqueness. Thus by Lemmas 4 and 5 and the usual 
argument by contradiction, the weak limit of X n is just (f>(t) = (it, t G [0, 1], w.p.l. This implies 
(16), and Proposition 1 follows. 
□ 

Proof of Lemma 3: Properties 1 and 3 of Definition 1 are obvious. Since Proposition 1 assumes 
fi G J rK ' K , it follows that <f>(t) G F K < K for all t G [0,1]. Since F K < K is a subset of <9IR+' A ", 
properties 2 and 4 follow. Next, if /3 = /3 then \i]\(t) = for t G [0, 1], and property 5 obviously 
holds. If fi / fi 0l then let j(t) = (fi - fh)/\\fi - /3 || for t G [0, 1]. It immediately follows that 
V(t) = lo md\v\(0). Since by (8) 

I,v 

it follows that j(t) G d(x) if x G T K ' K by part 4 of Condition 4. Since (f(t) G T K ' K for all t G [0, 1], 

property 5 of Definition 1 follows as well. 

□ 

Proof of Lemma 4: That the family {(X n ,Y n ) , n G IN} is tight in the Skorokhod topology 
follows immediately from Aldous-Kurtz Theorem (see, e.g., [12]). Hence {Z n } is also tight. For 
the predictable part of Z n we have the following expression: 

Z n (t) = f J2(u(X n (0),v) - u^ v )vd9. (18) 
Jo v 

In particular, the trajectories of Z n are Lipschitz with a common coefficient, and hence {Z n } is 
tight. Similarly, {R n } is tight, and in fact there is B < oo such that R n (l) < B for all n G Zu 
w.p.l. 

Thus all that remains is to show the tightness of as random variables in a space of measures 
with the weak topology. Let C' e C -D([0, 1] : IR^' 11 ) be compact and such that 

pn £ C e } > 1 — 6 

for all e G (0, 1) and all n G IN. Then there is M e < oo such that for any X n G C e 

\\X n (t)\\ < M t < oo (19) 
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for all t £ [0,1]. We recall the bound R n (l) < B < oo, which implies /z n (E) = R n (l) < B. If 
/2 n (T<) / then ^"/^"(E) is a probability measure on S. Using (19), for all e £ (0, 1) there exists 
a compact set C' C S such that for n£lN 

P"{^(S)/0and^|p> e }< e . 

Hence 

P n / and -^y £ C^j > 1 - 25, (20) 

where 

C? = n {veVV):v((C' 2 - m y)<2- m }. 

m£]N:2-'»<8 

For each 8 £ (0, 1) the set of probability measures C" is tight, and therefore by Prohorov's Theorem 
it is also relatively compact. To complete the proof of tightness of the random measures /2 n we 
must also show that for all e > there exists M such that P{pi n {T 1 ) > M) < e. Since this is implied 
by the w.p.l bound /U ra (S) < B, the proof of the lemma is complete. 
□ 

Proof of Lemma 5: According to the Skorokhod Representation Theorem, there exists a probabil- 
ity space on which there are random variables having distributions identical to (X n , Y n , Z n , Z n , R n , /2 n ) 
and (X, Y, Z, Z, R, //), and for which the convergence is in the a.s. sense. In the proof of the lemma 
we will make use of this alternative space, but without changing the notation (see Shiryaev [16]). 

In the proof to follow there will be numerous properties of and relations between the limit and 
prelimit random variables that hold only in a w.p.l sense. To simplify the discussion, the w.p.l 
qualifier will be omitted. Statements that hold only in an a.e. sense for the time variable will be 
explicitly identified, in which case the qualifier also holds w.p.l. 

Suppose that the quadratic variation up to time t of the martingale Z n (t) is denoted by (Z n )(t). 
Then by the Burkholder-Davis-Gundy inequality, there exists a constant c < oo such that 

E n sup \\Z n (t)\\ 2 < cE(Z n )(l). 
te[o,i] 

Since the jump rates and jump vectors are all uniformly bounded, there is c\ < oo such that 
that E(Z n )(l) < ci/n. Therefore Z n converges weakly to zero in the sup norm, and consequently 
Z = Z. 

We next observe that X n (t) = Y n (t) + Z n (t) and also that X n (t) £ M+' A " for all n £ IN and 
t £ [0, 1]. The almost sure convergence implies the analogous equality and inclusion for the limit, 
and therefore properties 1 and 2 of Definition 1 follow. Recall from the proof of Lemma 4 that 
there is B < oo such that R n (l) < B, which implies R(l) < B. Since the total variation of an 
element of D([Q, 1] : S) (for any Polish space S) is a lower semi-continuous functional, we also have 
|^|(1) < oo, and property 3 follows. 

To prove properties 4 and 5 we first define the sets 

Si = {(t,x,j) £ S : x £ T K >®}, 
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S 2 = {(t lXll ) £ S : 7 £ d(a;)} 

and 

E 3 A = {(i,x,7)eS:|x-I(f)|>A}. 

Note that for i£ [0,1], 

= / jfi n (de,dx,dj). 

J[0,t]xE' 

Since the trajectories of Z n are Lipschitz continuous with a Lipschitz constant that is independent 
of n and a;, the limit Z is also Lipschitz continuous. Since fJ,({t} X £') = (a.e. in £), the weak 
convergence /i n — > fj, and the last display imply 

Z(t) = ( jfj.(d0,dx,dj) (21) 

J[0,t]xE' 

for all t £ [0, 1]. It follows from (18) that 

/ l{X n (t) £ T K ^}dR n {t) = 0. 
i[o,i] 

Thus /2 n (T<i) = for all n £ IN, and since Si is open relative to S the weak convergence also implies 

M^i) = 0. (22) 

Now by part 4 of Condition 4, j n (t) £ rf(X n (i))U{0} for all t £ [0, 1], and by (18) (dR n /dt)(t) = 
whenever 7™(i) = 0. Thus 

/ l{ 7 n (t) ?d(X n (t))}dR n (t) = 0, 
J[o,i] 

which is the same as saying ^"(X^) = 0. Since S 2 is open relative to S, the weak convergence also 
gives ^(£2) = 0. 

To finish the proof of properties 4 and 5 of the SP, we use the uniform convergence X n — > X , 
which for fixed A > implies that ii n {Y,^) = for all sufficiently large n. Again using the weak 
convergence, /i(S^) = for all A > 0. Sending A — > 0, it follows that fi is supported on S(0, 1), 
where 

s(o,t)= U ^(0), 

ee[o,t] 

and 

S(t)={(t,a;,7) £ S : a; & F K ' 9 ,y £ d(a;),a; = X(i)}. 
From equation (21) we get 

Z(t) = jfi(d6,dx,dj) 
is(o,t) 

for £ £ [0,1]. Let X(t) = fj,{[0,t] X £'). Then there exists a measurable mapping £ £ [0,1] 1— > 
fj,(dx, dj\t) £ A4(£'), such that for each Borel set A, 

(i([0,t]xA)= / / fj,(dx,dj\0)\(d0). 

J[0,t] J A 
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It follows that 



where 



Z(i)= / C(0)A(d0), 

J[o,t] 



((t) = y[j,(dx,dy\t). 

Writing ((t) = a(t)j(t), where a(t) = \\((t)\\ and j(t) = ((t)/a(t) if a(t) > and j(t) = otherwise, 
we have that j(t) G d(X(t)) A-a.e. Hence 



and consequently, 

Z(t) = f l{9)\Z\{d9). (24) 
J[o,t] 

It follows from (23) that \Z\ is absolutely continuous with respect to A. Hence property 4 of the 
SP is implied by (22), and likewise property 5 is implied by (24). 



\Z\{t) = / a(0)X(d0), (23) 
J[o,t] 



5 Examples 

In this section we verify all the conditions required by Theorem 4 for two interesting examples: a 
processor sharing model and the classical Jackson network. For both these systems the full model 
and all localized models will take the form assumed in Sections 2 to 4. For each example we must 
carry out the following tasks. 

• Verify that the jump rates of the original model are constant in each facet. 

• Check the communication condition (Condition 2). 

• Verify the required uniqueness and perturbation properties of solutions to the system (8). 

• Check that all corresponding SPs are regular whenever c v > for all v G V. 

5.1 The processor sharing model 

Consider a queueing system that consists of a server and N classes of customers that can be served, 
one at a time. There is one queue for each class, and the customers arrive and enter the queues 
according to their class. A column vector / = (/i, . . . , /jv) t satisfying J2i fi = 1 is given, where the 
number fi > represents the minimal fraction of the overall service capacity guaranteed to class i 
(see Figure 4). The probabilistic model is as follows. The arrivals are modeled as N independent 
Poisson processes with rates a 4 - > 0, i = 1, . . ., N. The service times of customers of class i are 
exponential random variables with parameter > 0, independent of each other, of the service 
times of other classes, and of the arrival processes. When the server is free, service will be offered 
to a customer in one of the non-empty queues, say queue i, chosen at random, with probability 
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Figure 4: Processor sharing model 



proportional to fi<Ji, independently of the other choices, service times and arrivals. If all queues 
are empty then no service takes place. 

The vector defined by the number of customers of each class that are in the system, including 
those in the queues and the one being served, is a jump Markov process on . Its jump intensity 
from x £ 7/. X to x + e t is a t for i = 1, . . . , A. Moreover, if x £ 7/. X n jr{i,-,JV},-f anc i / ^ | 1) ^ 
then for i £ I the intensity of the jump from x to x — e 4 - is (Jifi/fic, where fjc = J2iei c fi an d 
I c = {1, N}\I. Thus the relative fraction of service offered to all nonempty queues is independent 
of the state of the system. Other models, such as models with modulated arrival rates and discrete 
time models could also be considered. However, the main point of the analysis is to show how one 
can deal with discontinuities in the statistical behavior in this multidimensional setting. 

As noted in the last paragraph, the set V of allowed jump directions is {±e 4 -, i = 1, . . . , A}. In 
addition, the intensity function ri tV is given by r^ et = a 4 - for all facets / and all i = 1, . . ., A, and 
rj _ e . = Gifi/ fjc if i (j£ I and ri- £t = otherwise. Thus Condition 1 holds. 

We next consider the most involved condition, which is Condition 4 on the regularity of the 
associated SPs. This requires that we first identify the controlled processes that must be considered. 
Recall that these processes will have jump rates that are perturbed versions of the original jump 
rates. Given a solution of the system (8), the new jump rates will take the form ui tV = c v ri tV . In 
particular, for any v ui tV /ri tV is independent of the facet /. We will show that Condition 4 holds 
for K = {1, . . .,N}. As noted before Lemma 2, this implies that the condition also holds for all 
K C {1, . . . , A}, i.e., all local models. 

To simplify the notation we will write cf = c± ei , i = 1,...,A. The vectors that appear in 
Condition 4 can be calculated as follows. If / / {1, A}, then 

j N 

v iei c *'- fc «=i 

For facets j^{h-,N},l of 

co-dimension 1, namely for / = {j}, it is easy to verify that the last display 
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reduces to 

Y^ c v( r i,v - r$ )V )v = _ J CA(ej - /), 

v J 3 

where C = diag(c~ : i = 1, . . .,N) and A = diag(<7 8 ' : i = 1, . . .,N). We define 



d 



_■ CA( ej -/) 

J IICAfo-/)!! 



for j = 1, . . . , N. If rij = ej, then obviously (dj, rij) > 0, j = 1, . . . , N. We next examine facets 
co-dimension higher than 1, (i.e., we let / consist of more than 1 element), but still 
assume / / {1, N}. In this case 

J2 c v( r i,v - r <n,v) v = J2j^ CA ( et - f) = J2 atd ^ 

y iei ^ iei 

where oi{ = ||CA(e 4 - — f)\\fi/fi<^- Hence if the expression in the latter display is non-zero then, when 
normalized, it belongs to d(x) for any point x in the facet J^i 1 '---^}' 1 . 

Lastly, we must consider the case / = {1, ...,N}, which corresponds to the facet {0}. For all 
i = 1, . . . , N ri-e t = 0, and hence one constraint direction at the origin should be 

N 

c v( r i,v - r <n,v) v = c i a *f* e * = CA f- ( 25 ) 

v i=l 

Note that the latter cannot be obtained by a linear combination of di, i = 1, N, since for all i 

(CA(e 8 -/),C- 1 A- 1 (l,...,l) T ) = 0, 

while 

(CA/,C- 1 A- 1 (1,...,1) T > = 1. 

Following the treatment in [9], we supplement the set of normals rii = ei, i = 1, . . . , N with 

n JV+1 = CA(l,...,l) T /||CA(l,...,l) T ||, 

thus introducing the extra constraint {x £ M, N : (x, rajy+i) > 0}. Note that the domain is still 
given by n^ 1 -^ £ 1R N : (x,rii) > 0}. By also setting d^+i = n-N+i, it is now possible to obtain 
the direction (25) as a linear combination of di, i = 1, . . . , N + 1 with nonnegative coefficients. We 
have therefore shown that parts 1, 2 and 4 of Condition 4 hold with q = N + 1. To complete the 
verification of Condition 4, all that remains is to check the regularity of the SP. Regularity for the 
case where CA is the identity matrix is proved in [9]. Extending the proof to general CA is easy 
since this SP can be obtained from the case of the identity matrix by a diagonal change of variable. 



Theorem 6 (Dupuis and Ramanan [9]) Consider the SP associated with {(n 4 -, di), i = 1, . . . , 
1}, where for i = 1, . . . , N, n t = e i; and d l = (e l - f)/\\e l - f\\, and n N+l = d N+1 = J2tLi e t /VN. 
Then the corresponding SM is regular. 
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Now let hi > 0, i = 1, . . . , N be fixed, and denote B = diag(6 8 - : i = 1, . . . , N). Then we have 
the following. 

Corollary 1 Consider the SP associated with {(hi, di) , i = 1, . . . , N+l}, where for i = 1, . . . , N+l, 
hi = Bni/\\Brii\\ and di = Bdi/\\Bdi\\, and ni and di are as in Theorem 6. Then the corresponding 
SM is regular. 

Proof: Note first that for i = 1, . . . , N, hi = e 4 -, and rajv+i = B(l, . . ., 1) T /||_B(1, . . . , 1) T ||. There- 
fore the definition of hi is consistent with G, which is still given by n^ 1 -^ £ IR^ : (x,hi) > 0}. 
We show below that for any ip £ Dq([0, 1] : IR^) which is of bounded variation, ((f), i]) solves the SP 
for ip with respect to G and d if and only if (Bcp, Br]) solves the SP for Bip with respect to G and 
d. On the subset of Dq([0, 1] : IR^) of paths of bounded variation, this would imply uniqueness 
and Lipschitz continuity of the SM associated with G and d, based on these properties for the one 
associated with G and d. The extension to L>g([0, 1] : K W ) is as in [9]. 

Let us then assume that (<p, r]) solves the SP for ip with respect to G and d. It remains to 
check properties 1-5 of Definition 1 for (Bcj), Br]) and Bip with respect to G and d. Properties 1-3 
are immediate. Note that (f)(9) £ dG if and only if B(f>(0) £ dG, and that \(Br],ei)\ is absolutely 
continuous with respect to \tj\. Since we have that J| 1 j l{(f>(0) (j£ dG}d\r]\(0) = 0, it follows that 
J [ol] l{B(f)(0) £ dG}d\{Bi 1 ,e l )\(0) = 0, i = 1,...,N, and hence f [ol] l{Bcf)(0) £ dG}d\Br]\(0) = 0. 
Thus property 4 follows. 

Let 7(-) be as in property 5 of Definition 1, corresponding to (ip, <p, rf). Define j(t) = Bj(t) /\\Bj(t)\\. 
Since d(cp(t)) = d(B(p(t)), t £ [0,1], the definition of d implies that j(t) £ d(B(p(t)) whenever 
j(t) £ d((p(t)). Since T](t) = Jj t ^ j(0)d\i]\(0), we have 

\B V \(t)= f \\B 7 (9)\\d\vm- 
J[o,t] 

Thus Br](t) = Jj t ^(Bj(0) /\\Bj(0) \\)d\Br]\ (0) , and property 5 follows. The reverse direction is 
obtained similarly, using the transformation B~ l instead of B. The proof of the corollary is 
therefore complete. 
□ 

In view of the corollary, Condition 4 is verified, and we now turn Condition 2. It suffices to 
verify this condition for just the full model. We first identify for each x £ those directions v 
for which r(x, v) > 0. Since a 8 - > for i = 1, . . . , N, we have that r(x, e 8 ) > 0, i = 1, . . . , N. Recall 
that I(x) is defined by I(x) = {i £ {1, N} : (x, e 8 ) = 0}. Since o~ifi > 0, i = 1, . . . , N, we have 
that r(x, -ei) > for alH £ {1, . . . , N} \ I(x). It follows that 

r(x, v ) > for all x £ 7/.^ and v £ V for which x + v £ 7/.^ . 

In other words, the set of possible jumps is always V , unless such a jump takes the process out 
of 77.^ . Suppose we are given points x,y £ 77.^ . Then it is easy to construct a sequence of 
points {a;o, x\, . . . , xj} with xq = x, xj = y, X{ — Xi_\ £ V and X{ £ 77^ for all i = 1, . . . , J, 
where J = X)fcLi \{lji e k) — { x , e k)\- By the last display, we have that for any such sequence, 
r(xi-\, Xi — Xi-i) > for all i = 1, . . . , J, and Condition 2 follows. 
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We next verify Condition 3. This condition must be verified for each local model, and so we fix 
K C {!,..., N}. Suppose the quadruple ({pi}icK, {r v }vev, {c v }vev, ft) satisfies (8), where c v > 0, 
v £ V, and [3 G J- K,K . Since r/ jej = a 4 -, / C K, i = 1, . . . , N, it follows from (8) that f ei = a 4 -, 
i = 1, . . . , N. Let us denote r 4 - = f_ ej . Then for i = 1, . . . , N the last line of (8) implies 

c t a i ~ c 7 T i = (P, e i), ( 26 ) 

and hence all {r v } ve y are uniquely determined. Similarly, if ({p'i}icK,{r' v }v£V,{ c v}v£V, fl') sat- 
isfies (8), and (3 1 G T K,K , then as before f' e . = a 4 -, and the last display together with its primed 
analogue imply c~ (t- — Ti) = ((3 — /3',e 8 ). Hence \r' v — r v \ < ||/3 — (3'\\/ min 8 - c~, d£F, and part 1 
of Condition 3 follows. 

As for part 2 of Condition 3, suppose that the quadruple ({pi}icK, \fv\vev-, { c v}veV, ft) satisfies 
(8), where (3 G T K,K , but we only know that c v > 0, v G V. If p® > 0, we set p\ = pi, I C K. 
Otherwise, for some small k > we set p'^ = k, and pj* = pi* — k for some /* for which pi* > k. 
For all / (j£ {0, /*} we take p\ = pj. Since r0 v > 0, u G V, it follows that = J2i P'i r l,v > for all 
v £ V . For i = 1, . . . , N , we next define c~' = c~ + k and 




where as before r 4 - denotes r'_ e . > 0. With these definitions we have cf'cii — c- 't[ = ((3' , e 8 ), where 
P' = P + J2 t ^i<( Ka t - KT [) e i ^ -^ A ' A • Hence the quadruple ({/Oj}/ C A', {^}„ e y , i c 'v}vev, /?') satisfies 
(8). Since t[ > we have cf 1 > 0. Finally, we observe that if k — > then p\ — > pi, f' v — > f v , 
c' v —7- c v , and [3' — > (3. This proves that a slightly stronger statement than part 2 of Condition 3 
holds, since c' v — > c v for all v G V, and in particular, c' v are bounded for v G {w e V : f w = 0}. 
This completes the verification of Condition 3. With all four conditions verified, the identification 
of the rate function for the processor sharing model is complete. 

We have just shown that all the conditions of Theorem 4 hold for the model of this subsec- 
tion. Let us rephrase the formula (17) in the current setting. The sum J2v ^v^( c v) translates to 
J2iLi[ a i^( c t) + T i^( c i)]- Also, the equation r v = J2i Pl r l,v m (8) translates as follows: r ei = a 4 -, 
and 

Piri-e, = 2^ pI ~f~' 

I ICK:ieI c •' /C 

where fjc = J2jel c fj- The special form that Theorem 4 takes in this setting is therefore stated as 
follows. 



Theorem 7 Let a 4 - > 0, fi > and 0{ > 0, i = 1, . . ., N be given, and consider the associated 
processor sharing model, as described in this subsection. Then the rate function for the local model 
corresponding to K C {1, . . . , N} takes the following form, for fi G T K,K : 

N cfa l - c~T t = ((3,e 

L(f3) = inf ! J2 [ a A c t) + r Ac~)] : n = <Tifi E/cA':*e/ c Pilfi 



i=i pi > 0,12^1 = ^,4 >o 



where = T,j£ifj- 
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Figure 5: Jackson network 



5.2 The Jackson network 

Consider a queueing system in which customers of one class occupy N nodes, where each node 
consists of a queue and a server. Customers arrive at a given queue from either other nodes or 
outside the system, and after being served, they may move to one of the N queues (including the 
queue at the current node), or exit the system. The statistics of arrival, service time and routing 
variables depend on the node to which they correspond (see Figure 5). In particular, the arrivals 
are modeled as independent Poisson processes with rates a 8 - > 0, i = 1, . . . , N, where a 8 - > for at 
least one i = 1, . . ., N. Service times are independent of each other and of the arrival processes, 
and for server i they are exponential random variables with parameter > 0, i = 1, . . ., N. The 
routing variables are independent of each other and of the arrivals and service times. We denote 
the probability that a customer leaving server i is routed to queue j by pij, and the probability 
that they exit the system by p 8j o, for i,j = 1, . . ., N. We assume that p 8j o > for at least one 
i = 1,...,N. Moreover, we assume that the sub-stochastic matrix {pij '■ i,j = 1,...,N} is 
irreducible. Thus for every i,j = 1, . . . , N there exists a sequence of indices i = io, i\, . . . , ij = j 
such that Pi k _ lt i k > for k = 1, . . ., J. 

For this process, the vector defined by the number of customers at each node is a jump Markov 
process on . The event of arrival of a customer to node i corresponds to a jump in direction e 4 -, 
routing from node i to node j / i to a jump in direction e 4 -j = ej — e 4 -, and exiting the system from 
node i to a jump in direction — e 4 -. When a customer is routed from a node back to the same node, 
no jump occurs. The jump intensity from x £ 7/.^ in direction e 4 - is therefore a 4 -, i = 1, .. ., N. 
Moreover, if a; £ n fi 1 '-^}' 1 and i £ I c then the jump intensity in direction e 4 -j is (TiPij, while 
in direction — e 4 - it is (T 8 'p 8j o- If i £ I then both jumps have intensity zero. Thus Condition 1 holds. 
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Define 

H+ = {i:ai>0}, H~ = {i : Pl , > 0}, H = {(i, j) : p 8J > 0, i / j}. (27) 

The set V is given by V = {e 8 , i £ ii+} U {-e 8 , i £ if"} U {e 8J , (i, j) £ if}. Given i C {1, N} 
the intensity function is given by r^ et = a 4 -, r/ )ej ■ = (JiPijl{i (j£ I}, ri- £t = a iPifi^-{i $ I}- The 
jump intensities for the controlled processes we consider are given by ui tV = c v ri tV , where c v > 0, 
v £ V are fixed. 

As in Subsection 5.1, our most involved task is to verify Condition 4 for K = {1, . . ., N}. We 
denote cf = c± ei and c 4 -j = c Ci . , for i,j= 1, . . .,N, i / j. For i C {1, W} we have 

N N N 
C v( r I,v ~ r 9,v) v = C hj( a ^Ph3 1 { i i !} - a iPi,i) e i,i ~ C * ( a iPh0 1 i i £ J ) ~ a iPifl) e i 

V i=l 3 = 1 i=\ 

3*i 

N N N 
= C h3 a iPh3 e h3 + C i a iPhO e i (28) 

i=l 3=1 i=l 

,ei 3¥< <ei 

Therefore, we define the SP {(n 4 -, di),i = 1 . . . , q} of Condition 4 by setting q = N , n 4 - = e 4 -, and 

JV 

^' = - ^2 c hj a iPi,j e hj + c i a iPi,0^i (29) 

for i = 1, . . . , N . Part 1 of Condition 4 obviously holds. By the irreducibility assumption, for every 
i = 1, . . . , N there exists j £ {0, 1, -/V}\{i} for which pij > 0. Hence 

N 

(d t , n t ) = Ci,i°iPi,i + c~a t p t)0 > 0, 
and part 2 of Condition 4 holds. To check part 4 of Condition 4, note that 

If the quantity in the last display is non-zero then it points in a direction in d(x), where x £ 
jr{l,...,N},I 

To show regularity of the SP let us state the following result from [9], which is a slight gener- 
alization of a result in [10]. 

Theorem 8 Consider the SP associated with G C M> N and {(di,ni),i = l...,N}, where the 
directions of constraint {d{\ are linearly independent. Let the matrix Q be defined by 

where Sij = lifi=j and equals zero otherwise. If cr(Q) < 1, then the corresponding SM is regular. 
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We show that the directions of constraint di, i = 1, . . . , N defined by (29) are linearly indepen- 
dent. Suppose for some «i, . . . , ajy, one has J2i a i^i = 0- Setting c^- = 1, i = 1, . . . , N, we have by 
(29) that 

N ( N "I N 

< c~a t p tfi + ^ '■ . n l< .; \ e t = ^ a;c;j<7;p;jej. (30) 

i=l ^ j = l ) i,j = l 

Let 

N 

li = c~a t p t)0 + c i,i a iPi,ii i = l,...,N, 

3 = 1 

and note that ji > 0. Let also 

Pi,j = Ci,j<?iPi,j/li, h3 = t,---,N, 

and note that \pij] is a strictly sub-stochastic matrix, since p 8j o > for some i, and hence 1 cannot 
be an eigenvalue. However, with this notation (30) becomes 

N N 
^2a t j t e t = ^2 0i iliPi,3 e 3^ 

i=l 2J = 1 

which implies that a.{ — 0, i = 1, . . . , N. Thus the di are linearly independent. To calculate the g ?J - 
of Theorem 8, note that for i / j, tij) = c ? -j<7 ? -p ? -j, and thus qi^ — 0, while 

1,3 J2k^ t c i,k<JiPi,k + c~a t p tfi ' 

for j / i. From this it is evident that Q = [qij] is strictly sub-stochastic, and therefore cr(Q) < 1. 
Since Theorem 8 holds, so does Condition 4. As discussed prior to Lemma 2 it therefore also holds 
for all K C {1, . . . , N}. 

Before verifying Conditions 2 and 3 we fix Ji' C {1, . . . , N} and consider the corresponding local 
process, which is a jump Markov process on S = Z5+ ,K . The intensity function is given by 

di v = e,-, 

ri, v = { <^iPifi v = -e t , i £ I, (31) 

®iPi,j v = ^ ^ 1 1 

and zero otherwise. It is useful to notice that one also has the following expression for the intensity 
function: for x £ 7/.^.' ix . v £ V, 

r(x, v) = rq, tV l{x + v £ Z5+ ,K }. 

In other words, the process uses the generator of the original queueing model that applies in the 
facet 0, except possibly at points on the "boundary" (i.e., a facet), in which case it simply deletes 
any jumps that would take it outside 7/.^.' ix . 



We need verify Condition 2 only for the full model, and so take K = {1, N}. This condition 

zN 



requires that we connect any two points in S = (in terms of the intensity function being 
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positive) with a linear bound on the length of the connecting sequence. We first show that it holds 
for any two points on any "simplex" of the form {x G : J2i( x i e i) = A}, and then extend to all 
of . Within each simplex, we show the existence of a connecting sequence by first constructing 
one for each two "neighboring points" (i.e., points x, y such that y = x — e t : + ej), and then moving 
between any two points on the simplex along neighboring points. 

Let i,j G {1, . . . , N}, and let i / j. Then by the irreducibility assumption there exists a 
sequence of indices i = io, i\, . . . , ij = j, where J < N , for which Pi k _ lt i k > for k = 1, . . . , J. 
Hence, for any z G 77^ it follows from (31) that the sequence Zk = z + k = 0, 1, . . . , J, is such 
that r(zk, Zk + i — Zk) > for k = 1, . . . , J. Suppose that x, y G 77.^ are such that y = x — e t + e j . If 
we show that z = x — e 4 - G then it will follow that there exists a sequence x = xq, x\, . . . , xj = y, 
Xk G 77.^ . k = 1, . . . , J such that r(xk, Xk+i — Xk) > for k = 1, . . . , J. However, (z, e 8 ) > follows 
from (z, ei) = (y, e 8 ), and the fact that y G 77^ . By induction, if x, y G 77^ are any points such that 
J2i( x i e i) = J2i(lJi e i)i then there exists a connecting sequence whose length is at most \\x — y\\iN. 

Now suppose that x,y G and A= J2i(Vi e i) ~ J2i( x i e i) > 0j an( i recall that aj > for 
some j G {1, . . . , N}. Hence one can consider the sequence x, x + ej, . . . , x + Aej and use the fact 
that all the intensities r(x, ej), . . . , r(x + (A — l)ej, ej) are positive. To construct a sequence from 
a; + Aej to y along which the transition intensities are positive, one can use the argument of the 
last paragraph. For the case where A < 0, recall that pk t o > for some k G {1, . . . , N}. Thus for 
all z G one has r(z + e^, z) > 0. Hence if A < one can first construct a sequence from x to 
y + |A|efc along which the transition intensities are positive, again using the argument of the last 
paragraph, and then move from y + | A|e^ to y. The total length of the sequence from x to y in all 
cases is at most (N + — y\\\, and Condition 2 therefore holds. 

We begin the verification of Condition 3 by rewriting the last equation in (8) in accordance 
with (31). Let the set K C {1, . . . , N} be fixed. Then 

I,v 

N 

i=l IcK igl ICK igl j^i 



For i = 1, . . . , N define 



n = E pi- ( 32 ) 



Then 



N N [ N I 

fi ~ ^2 c t a i e i = t2 T A - c I a iPifi e i + E C h3 a iPhj( e 3 ~ € i) \ ( 33 ) 

8=1 8=1 I J=l I 



N 



8 = 1 



where the d{ are as in (29). If we write D for the matrix (di, . . . , d^) and r for the vector 
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(ti, . . . , tjv) T , then the last display becomes 



Dt 



-/3 + E 



(34) 



8 = 1 



It was proved immediately after the statement of Theorem 8 that D is non-singular. It follows that 
t depends continuously on (3. Further, for v = e 4 -, r v = a 4 -, and for v = — e 4 - or u = e 4 -j, 



r 8' r 0, 



(35) 



Hence f„ depends continuously on r, and part 1 of Condition 3 follows. Note that in addition to 
(34), t is required to satisfy some further conditions, e.g., r 4 - = 1 for i £ K c . Hence for a given (3 
and set {c v } ve y there may be no solution to (8). However, this does not affect the last argument, 
since Condition 3 only refers to properties of the solutions of (8) when they exist. 

Verifying part 2 of Condition 3, which is straightforward but detailed, is postponed to the 
appendix. 

Having verified all assumptions of Theorem 4, we now rephrase it is the context of the Jackson 
network example. Let us return to (34) and recall that 



D = [d hJ ], d itj = {di,ej) 



E^=l Ci,k<7iPi,k + C l OiPifl i = j 



\ Gi,j&iPi,j 

is a non-singular matrix if c~, c 4 -j > 0. In addition, we let 

C + = diag(c+ : i = 1, . . ., N), a = (a u 
Note that (8) corresponds to the following set of conditions: 

pi > o,E/P/ = i, 

T i = J2l:i(£I Pli 

c+ r- > n 

8 ' l i3 1 



(36) 



(37) 



Dt = C + a - (1. 



We then have the following. 



Theorem 9 Let a 4 - > 0, 0{ > 0, pij > 0, i = 1, . . . , N , j = 0,1,..., TV be given, such that 
J2f=oPi,j = I; * = 1 , - - - , ; \pi,j]i,je{i,...,N} ?s irreducible, and a 4 - > 0, pj t o > for some i,j £ 
{1, . . ., N}. Consider the associated Jackson network, as described above. Then the rate function 
for the local model, corresponding to K C {1, . . . , N}, is as follows for fi £ ■ 



N 



L((i) = inf {J2 



8 = 1 



N 



a t i{cf) + T t a t p t)0 £(c t ) + T t a t ^ pij£(c it j) 



j'=i 

3*i 



Pi > 0,E/P/ = !, 

. T i = J2l:i£I Pli 

' c+ r- > 

C 8 ' % J — ' 

Dt = C+a - (3 



where D, C + and a are as in (36) and (37). 
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6 Appendix 



Proof of part 2 of Theorem 2: 

The inequality q%i(y] /3, e) < q n (y; /3, e) holds for any fi £ M w , e > and y £ S' n . Therefore 
lim lim lim lim sup sup g M (y; /3, e) < lim lim lim sup sup q n (y; (3,e). 

M-»oo e-S>0 5-»0 n -»oo {^e^S" : 1 1 y| | < 5} e-S>0 5-»0 n ->oo {^e^S" : 1 1 y| | < 5} 

We show below that for all sufficiently small e > there is n = 77(e) > 0, no = rao(e) < 00, and Mo 
which depends only on (3 such that 

q n (y-J3,t)<q n M(y;P,v) (38) 

for all M > Mo and n > n®. Thus part 2 of Theorem 2 will follow from part 1 of the same theorem. 

Consider the stopping time 

T n = mi{t £ [0,1] : \\S n (t) - t(3\\ > r,} 

(where inf = 00), and let Tjn denote the stopping u-field associated with T n . It follows from (4) 
that 



(y; /3, rj) > inf E n y I ^ £ r{E. n {t),v)l [-^ 



9m 



u n (S n (t),v,t) 



■(«)>«) 



(ft + Ml{T n <ocU, (39) 



where the infimum is over all admissible controls u n and associated controlled processes H ra . The 
inequality is due to the fact that the upper limit of integration 1 has been replaced by T n A 1. Also, 
according to Theorem 1 



q n (y;(3,e) = mfE;\ / *£r(E n (t),v)e 



u n (S n (t),v,t) 



>(t),v) 



dt + g(~ n ;(3,e)\, 



where the infimum is over all admissible controls u n and associated controlled processes H ra . By 
part 1 of Theorem 2 for sufficiently small e > there are 8 = 5(e) > and no = rao(e) < 00 such 
that for all n > no and all y £ S n , \\y\\ < S there exists an admissible control ii n for which 



E-;\ / 5>(E»(t),t;)* 



ii n (s n (t),v,t) 



>(t),v) 



dt + g(S n ; P,e)}< L((3) + 1 = M < 00. 



We will need to define an analogous control problem for processes whose value is specified at 
times s £ [0, 1] (rather than just s = 0). Accordingly, let E™ denote conditioning on H ra (s) = y, 
and define g s (4>] (3, e) to be 00 if \\(f>(t) — t(3\\ > e for some t £ [s, 1], and zero otherwise. Then for 
sufficiently small e > there are 6 = 5(e) > and no = no(e) < 00 such that for all s £ [0, 1], all 
n > no, and all y £ S n , \\y — s(3\\ < 6, there exists an admissible control u n (-, •, •; s, y) for which 



7i n (H n (i), W ,i; S ,y)' 



1 



+ # s (E n ; /3, e) \ < (1 - + 1 < M . 
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Owing to the continuous dependence of the distribution of H ra on the control, u n can be selected 
so that it is measurable in all variables. 

We now define a composite control that will be used in the variational representation for 
q n (y, /3,e). Let r] £ (0,e A S). If u n is an arbitrary admissible control and ii n is as above, then 
let 

( u n (S n (t),v,t) for t £ [0, T" A 1), 

u n (H n (t),v,t;T n ,H n (T n )) for t £ [T n A 1, 1], 



(M) 



This control is exactly the same as u n up until the first time that the controlled process leaves the 
■q— neighborhood of fit. At that time the control ii n takes over, which will keep the process within 
the e— neighborhood of tfi for the remaining time with a total cost (over the remaining time) of less 
than Mo. Now since the controlled process stays within the e— neighborhood of tfi for all t £ [0, 1], 
the variational characterization of q n (y,(3,e) implies 



? n (y;M <inf {E, 



T n Al 



T"A1 



: n (t),v)£ 



u n (S n (t),v,ty 



dt 



*-(t),v] 
(S n (t),v,t;T n ,S n (T n )) 



1 



dt 



Strictly speaking, the control u n is not of feedback form, but instead belongs to the larger class of 
nonanticipating controls. However, using the fact that the infimum over these two classes is the same 
the inequality holds as stated. We next observe that the conditional expectation in the last display 
is bounded above by MoPy{T n < oo}. It follows from (39) that for any given admissible control 
u n the right hand side is a lower bound for the corresponding cost in the variational representation 
for q%j{y] /3, i]), at least if M > Mo. Since the last display holds for all admissible controls u n , (38) 
follows. 
□ 

Proof of part 2 of Condition 3 for the Jackson network: 

We now verify part 2 of Condition 3 in the setting of Subsection 5.2. We first prove the following. 

Lemma 6 Let K C {1, . . - ,N} be fixed. Then given any numbers < r 4 - < 1, i £ K, one can 
associate numbers pi > 0, / C K, J2icK Pi = 1; such that (32) holds for i £ K , and such that if 
i (j£ K then (32) implies r 4 - = 1. 



Proof: Let < r 4 - < 1, i £ K be given. For / C K, define 

ieK\i iei 

Since for any finite set of numbers {a{\i^j one has J2j'cJ^ieJ\J' a i][YiieJ'(^~ = 1> (^^) holds, 
as well as J2icK Pi = 1- If * ^ K, then by the last sentence the right hand side of (32) is one. 
□ 

Suppose we are given a quadruple ({pi}icK, \fv\veVi { c v}veV, P) satisfying (8), with fi £ J^ K ^ K . 
We let Ti, i £ K be defined by (32) and observe that r v , v £ V must then be given by (35). 
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Returning to the notation of (27), for any i G H (i 6 H + , respectively) for which c- = 
(cf = 0), we fix a sequence of indices j = k$ k^' 1 , . . . , = i (i = k^' 1 , . . . , = j), where 

j G H + (j G H~), and {k\_ x ,k 1 ^ G H for £ = 1,...,J~ (£ = It is possible that a 

sequence consists of just one element, in which case the latter requirement is irrelevant. Similarly, 
for any G H for which c 4 -j = 0, we fix a sequence j = k % Q, k\ 3 , . . . , k l j = i along which we 

also have (&^i> ^Y) £ H, £ = 1, . . ., J 8 j. We assume, without loss of generality, that each of the 
above sequences is self-avoiding, namely consists of distinct indices. 

In order to define cf', C{J and t', i,j G {1, . . ., N}, i / j we first introduce some notation. Let 
Wi,j = #{(p,q) : = (k p m q _ 1 ,k p n l q ), some m = 1, . ..,J p , q }, 
W+ 3 = #{p : = (ki%k^), some m = 1, . . . , J+}, 

W i\j = #{P '■ (*>•?') = (fcm-l> fc m P )) SOme m = !) • • J p"}> 

W-=#{p:t = k~' p }. 

The following are solely consequences of the definitions in the last two paragraphs. The first 
equality in each line is due to the fact that summation along all paths can be performed by first 
grouping the contributions of all paths to each index and i, and then summing on these 

indices. The second inequality in each line is due to the cancellation of all the middle terms along 
each path. 

Jp,q 

E w *,i( e i - e i) = E E ( e c q - e k p m q _ 1 ) = E e P~ e ^ 

i,j {p,q)^H: m = l {p,q)^H: 



E^^-^ + E^"^ E Et^-^l + v r £ e *» 

i,j i pes-: I m = l | peH- 

E (e,- - e t ) -J2W+e t = £ IJ^ (c^ - e^, ) - e k++P \ = £ 

i,j i p£H+: m = l J p I p£H+: 

c+=0 J -+- 

Let now k > be a number. If i is such that r 4 - > 0, we let r- = r 4 -, 



/ . I % i3 J 1 tj *)J l i3 

C h3 Ci '3 * K 



and 

_, _ , l{c" = 0} + W+ 

C t =C t + K % —. 

Ti<TiPi,0 
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If i is such that r 4 - = 0, we let t[ = k, 

, _ i{ c . j = } + w,-, j + w+ + w i -. 



c ■ 



and 



Also, for i G we let 



<?iPi,j 
l{c-=0} + W+ 

°iPi,0 



We now show that with the above definitions, (33) still holds true if one replaces (r 4 -, cf, c 4 -j) 
by (r 4 ', c^', c' •), with the same (3. In fact, 



N N ( N | 

E c+'a 8 e 8 + T l\ -c~'(T t p t , e t + ^ c^-p^fo - e t ) \ 
t = l t = l { 3 = 1 J 

N N ( N 

~ E C t a ^ - E T M - c T a iPifi e i + X] C i,j a iPi,j( e 3 ~ e * 
i=l i=l { 3=1 

= J2[l{c+ = 0} + W-]Ke t 

i 

-J2[l{c~ =0} + W+]ne t 

i 



pEH+: peH~: (p,q)eH: 

c +=o c~=o c p,q=° 

~p u ~p u 

+ E Kl i c * + = °} e « - E k1 K = °} e « + E = 0}(e, - e,-) 

i i i,j 

= 0. 

Hence (33) holds for (t- , cf' , c' •) . Based on r 4 ', i = 1,...,N one can now define p^, I C K as 
in Lemma 6, and r' v as in (35), and conclude that the quadruple ({p'j}icKj {r' v }vev, { c ' v }veV, ft) 
satisfies (8). By definition we have that for all k > 0, c v > 0, v G V, and also that as k — > 0, 
t[ —t- Ti, i = 1,...,N, and c' v — > c v for v G {w G V : r v / 0} while remains bounded for 
v G {w £ ^ : f„ = 0}. Consequently, part 2 of Condition 3 holds. 
□ 



Proof of Lemma 2: For x = there is nothing to prove, hence the contrary is assumed throughout 
the proof. By assumption there is K\ < oo such that for all bounded variation ipi,ip2 £ Dq([0, T] : 



sup ||r(^)W - r(^)WII < i<i sup ||^i (t) - Mt)l (40) 

te[o,T] te[o,T] 



38 



Let 

G= H {yeU N :( y , ni )>o}. 

iei(x) 

We first show that existence of solutions to the SP on G follows from existence of solutions to the 
SP on G. Let tp G Dq([0,T] : M> N ) be of bounded variation, and define 

i^wm- ^(o)ii + ||^(o)n + i 

a= max sup . 

*WMte[o,x] {x,ni) 

Let ^(-) = ip(-) + as, and denote by (cp, r]) the solution to the SP for t/> with respect to G and d{, 
i = 1, . . .,q. Let also <^>(-) = cp(-) — ax and fj = tj. The definition of a implies that <p(t) G G for 
£ G [0, T]. Moreover, for i G I(x) and £ G [0, T], {(f>(t), up) = {(f>(t), np). We also show below that for 

i^/(i),te[o,n 

(0(t),n,-)>O. (41) 

It follows from the last two assertions that for all t G [0, T] and i G (<f>(t), np) = if and only 

if (<p(t),n) = 0, and that for all t G [0,T], <p(t) G <9G if and only if <p(t) G dG. Hence (<p,f)) solve 
the SP for ip with respect to G and d;, i G I(x). 

To show that (41) holds for i (j£ I(x), use (40) and the definition of a to conclude 

> ^(o),^)-^!^^)-^^)!! 

= (4,(0),ni) + a(x,m) -KiU(t) - 4>(0)\\ 

> 1- 

This completes the proof of existence of solutions to the SP on G. 

We denote by T (respectively, T) the corresponding SM on G (respectively, G), and prove that 
Lipschitz continuity off follows from (40). Let ?/>i,?/>2 £ Dq([0,T] : M, N ) be of bounded variation, 
and for j = 1, 2 let (cpj, fjj) solve the SP for ipj with respect to G and d;, i G I(x). Define 

a= max max sup — ; — . 

3 = 1,2 i<ZI(x) te [ 0tT ] {x,ni) 

For j = 1,2 let ipj(-) = ipj(-) + ax, (pj(-) = (pj(-) + ax, and 7/j = fjj. For j = 1,2, i (j£ I(x) and 
£ G [0, T] we have 

<0j(*W > <0j(*W + sup ||^(0)|| + 1 

0e[o,x] 

> 1- 

It follows that (f>j (t) G G for j = 1, 2 and £ G [0, T], and moreover, that for alH G I(x), (cpj(t), np) = 
if and only if (cpj(t), np) = 0, and that (f>j(t) G dG if and only if <pj{t) G dG. Because of this, (cpj, rjj) 
solves the SP for ipj with respect to G and d{,i= 1, . . . , q, j = 1, 2. Hence 

sup \\Mt) - Mt)\\ < Ki sup \\^{t) - p, 2 {t)\\, 
te[o,T] te[o,T] 
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and regularity of T follows. 
□ 
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